Search results for "Voice activity detection"

showing 4 items of 4 documents

Towards a Non-Intrusive Context-Aware Speech Quality Model

2020

Understanding how humans judge perceived speech quality while interacting through Voice over Internet Protocol (VoIP) applications in real-time is essential to build a robust and accurate speech quality prediction model. Speech quality is degraded in the presence of background noise reducing the Quality of Experience (QoE). Speech Enhancement (SE) algorithms can improve speech quality in noisy environments. The publicly available NOIZEUS speech corpus contains speech in environmental background noise babble, car, street, and train at two Signal-to-noise ratio (SNRs) 5dB and 10dB. Objective Speech Quality Metrics (OSQM) are used to monitor and measure speech quality for VoIP applications. Th…

Context modelVoice activity detectionNoise measurementComputer scienceSpeech recognitionMean opinion score020206 networking & telecommunicationsSpeech corpus02 engineering and technology01 natural sciencesBackground noiseSpeech enhancement0103 physical sciences0202 electrical engineering electronic engineering information engineeringQuality of experience010301 acoustics2020 31st Irish Signals and Systems Conference (ISSC)

researchProduct

Embedded Knowledge-based Speech Detectors for Real-Time Recognition Tasks

2006

Speech recognition has become common in many application domains, from dictation systems for professional practices to vocal user interfaces for people with disabilities or hands-free system control. However, so far the performance of automatic speech recognition (ASR) systems are comparable to human speech recognition (HSR) only under very strict working conditions, and in general much lower. Incorporating acoustic-phonetic knowledge into ASR design has been proven a viable approach to raise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as de…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniVoice activity detectionArtificial neural networkDictationbusiness.industryComputer scienceSpeech recognitionSpeech technologycomputer.software_genreSpeech processingManner of articulationSilenceVowelComputer ScienceTelecommunicationsMel-frequency cepstrumArtificial intelligencespeech detectorUser interfacebusinesscomputerNatural language processing

researchProduct

Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

2021

Speech originating from the noisy environments degrades the speech quality and intelligibility, thus reducing the human perceived Quality of Experience (QoE). For example, surveillance using drone during natural catastrophe needs an efficient speech recognition device to recognise the speech of the frozen human in presence of drone noise to save their life. Therefore, it often requires to pre-process the noisy speech in order to reduce the noise artifacts and enhance the speech. This paper detects the speech activity using Voice Activity Detection (VAD). The VAD distinguishes speech activity (speech presence) and speech inactivity (silence/noise) by extracting the speech features and compar…

Speech enhancementEuclidean distanceNoiseVoice activity detectionNoise measurementComputer scienceSpeech recognitionFeature extractionSpectral centroidIntelligibility (communication)2021 6th International Conference on Communication and Electronics Systems (ICCES)

researchProduct

A General Fuzzy-Parsing Scheme for Speech Recognition

1985

In this paper a Speech Recognition Methodology is proposed which is based on the general assumption of ‘fuzzyness’ of both speech-data and knowledge-sources. Besides this general principle, there are other fundamental assumptions which are also the bases of the proposed methodology: ‘Modularity’ in the knowledge organization, ‘Homogeneity’ in the representation of data and knowledge, ‘Passiveness’ of the ‘understanding flow’ (no backtraking or feedback), and ‘Parallelism’ in the recognition activity.

Voice activity detectionParsingComputer scienceSpeech recognitionKnowledge organizationFuzzy subsetSpeaker recognitioncomputer.software_genrecomputerFuzzy logic

researchProduct